Spatio-Temporal Hough Forest for efficient detection-localisation-recognition of fingerwriting in egocentric camera

نویسندگان

  • Hyung Jin Chang
  • Guillermo Garcia-Hernando
  • Danhang Tang
  • Tae-Kyun Kim
چکیده

Recognising fingerwriting in mid-air is a useful input tool for wearable egocentric camera. In this paper we propose a novel framework to this purpose. Specifically, our method first detects a writing hand posture and locates the position of index fingertip in each frame. From the trajectory of the fingertip, the written character is localised and recognised simultaneously. To achieve this challenging task, we first present a contour-based view independent hand posture descriptor extracted with a novel signature function. The proposed descriptor serves both posture recognition and fingertip detection. As to recognising characters from trajectories, we propose Spatio-Temporal Hough Forest that takes sequential data as input and perform regression on both spatial and temporal domain. Therefore our method can perform character recognition and localisation simultaneously. To establish our contributions, a new handwritingin-mid-air dataset with labels for postures, fingertips and character locations is proposed. We design and conduct experiments of posture estimation, fingertip detection, character recognition and localisation. In all experiments our method demonstrates superior accuracy and robustness compared to prior arts. © 2016 Elsevier Inc. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Object-Centric Spatio-Temporal Pyramids for Egocentric Activity Recognition

Activities in egocentric video are largely defined by the objects with which the camera wearer interacts, making representations that summarize the objects in view quite informative. Beyond simply recording how frequently each object occurs in a single histogram, spatio-temporal binning approaches can capture the objects’ relative layout and ordering. However, existing methods use hand-crafted ...

متن کامل

A Spatio-Temporal Model for Forest Fire Detection Using HJ-IRS Satellite Data

Fire detection based on multi-temporal remote sensing data is an active research field. However, multi-temporal detection processes are usually complicated because of the spatial and temporal variability of remote sensing imagery. This paper presents a spatio-temporal model (STM) based forest fire detection method that uses multiple images of the inspected scene. In STM, the strong correlation ...

متن کامل

Large scale continuous visual event recognition using max-margin Hough transformation framework

In this paper we propose a novel method for continuous visual event recognition (CVER) on a large scale video dataset using max-margin Hough transformation framework. Due to high scalability, diverse real environmental state and wide scene variability direct application of action recognition/detection methods such as spatio-temporal interest point (STIP)-local feature based technique, on the wh...

متن کامل

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

Robust and efficient models for action recognition and localization. (Modèles robustes et efficaces pour la reconnaissance d'action et leur localisation)

This thesis addresses the problem of action recognition, i.e ., how to determine the type of action that is happening in a video and its temporal localization. First, we consider the problem of video representation—how to encode videos in a robust way, such that the representation is suitable for a wide variety of action classes, tasks and video types. We present an extensive evaluation study t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Vision and Image Understanding

دوره 148  شماره 

صفحات  -

تاریخ انتشار 2016